Factorization of integers and arithmetic functions 

LINCOLN DURST 
ABSTRACT 

Elementary proofs of unique factorization in rings of 
arithmetic functions using a simple variant of Euclid's 
proof for the fundamental theorem of arithmetic. 

1. Introduction. 

In The Elements [11, BOOKS VII and ix] Euclid proved that, 
except for the order in which the factors are written, positive integers 
greater than 1 can be expressed uniquely as a product of irreducible 
integers p. 

Euclid's argument has two main components: 

(1) Each composite integer ab, with a > 1, b > 1, has an irreducible 
factor p, i.e., a factor whose only divisors are 1 and p. [BOOK 
VII, PROPOSITIONS 31, 32] 

(2) Every irreducible integer is a prime, i.e., if p is irreducible, ab > 
1, and p | ab, then p \ a or p \ b. [VII, 20 and IX, 14] 

Euclid or Euclid's translators introduced the expression prime 
number to represent what may be referred to in terminology of twen- 
tieth century algebra as an irreducible positive integer. Euclid de- 
serves full credit, of course, for recognizing that irreducible positive 
integers have the more important property described in (2). 

The need for a distinction between irreducibles and primes arose 
early in the nineteenth century when an impasse was encountered 
while attempting to prove Fermat's Last Theorem, since Euclid's (2) 
fails to hold in some rings where (1) holds, especially in cases involving 
real or complex algebraic numbers. See Edwards [10], Hardy and 
Wright [12], and Jacobson [14]. 

2. Euclid's original arguments. 

For Euclid all integers are positive — and ordered, indeed well or- 
dered, meaning no sequence of positive integers in which each member 
is less than its predecessor can be an infinite sequence, but must ter- 
minate after finitely many terms. 

Here we are interested in factorization in a family of rings due 
to Liouville that are not ordered. First we modify Euclid's method, 
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using simple algebraic concepts, to reduce its dependence on the order 
relation Euclid used. Our variant of Euclid's (1) allows us to preserve 
Euclid's (2). Unlike the Fermat case, our procedure allows us to use 
only rational integers. In this sense our argument remains entirely 
elementary. 

Euclid begins BOOK VII with a list of definitions, the relevant 
part here being that numbers larger than one must be irreducible or 
composite. The former have only trivial, i.e., unit, proper divisors; 
the latter have nonunit proper divisors. 

Euclid's argument for property (1): If a and b are more than 
one, ab is composite. If a is irreducible, then ab has an irreducible 
factor. Otherwise a is composite, so some c divides a. If c is composite 
we repeat the previous step and begin generating a chain of proper 
divisors of ab. We must encounter an irreducible factor, for the only 
alternative is to generate an infinite sequence of divisors, each a proper 
divisor of its predecessor, and that is impossible. [11, vol. 2, p. 332] 

Euclid has two proofs for (2): FIRST. If p \ ab, then pc = ab and 
p : b = a : c, i.e., p/b = a/c. If p does not divide b, the only common 
divisor of p and b is 1, so the ratio p : b is in lowest terms and m > 1 
exists with a = rap, c = mb, hence p \ a. [11, vol. 2, p. 321] 

SECOND. Let a be the least number divisible by all the irre- 
ducibles pi,P2, • • • ,Pk- Suppose q is an irreducible distinct from each 
of pi,P2, ■ ■ ■ ,Pk, and assume q divides a. Then a = qm. Since q does 
not equal any of p\,pi ■ ■ ■ ,Pk, all the pi must divide m. Because m 
is a proper divisor of a, this contradicts the minimal property of a. 
[11, vol. 2, p. 402] 

Repeated use of (1) replaces each composite factor in a prod- 
uct by a product of irreducible factors. Thus one representation as a 
product of irreducibles is possible. Two such representations, involv- 
ing different sets of irreducibles are not possible, as (2) shows. Note 
that one irreducible positive integer divides another only if they are 
equal since the only divisors of each are 1 and itself. 

3. A variant of Euclid's argument. 

A set is said to be linearly ordered, or to be a chain, if, given 
any pair, a, b, of distinct members, a ^ b, one is greater than the 
other: a < b or b < a. A set is said to be well ordered if it is linearly 
ordered and if each of its nonvoid subsets contains a least element: 
one less than all other members of the subset. In addition to the set 
of positive integers, every finite linearly ordered set is well ordered. 
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No set of rational numbers, with their usual order, that contains 
the reciprocals of all the positive integers is well ordered. Examples: 
all the rationals, the real numbers, and complex numbers. 

The argument we consider here involves a few simple concepts 
from twentieth century algebra: rings, semigroups, partially ordered 
sets, and chains of divisors. 

If / is the ring of integers, /' the set of nonzero integers, and S 
the set of positive integers, the last two are examples of semigroups 
whose products are associative and commutative, and each contains 
the identity element 1 for the products in /. Since I is an integral 
domain, these semigroups are cancellative: xy = xz implies y = z. 

In any ring or semigroup, we define a divides 6, or a | 6, to mean 
the ring or semigroup contains a member c satisfying ac = b. 

The difference between a group and a semigroup is the former 
contains the reciprocal of each of its members. In the two semigroups 
here, only 1, in S, and only ±1, in divides every member of the 
semigroup, including the identity element. Members of rings or semi- 
groups with this property are called units. 

Possibly more esoteric than things mentioned so far, although 
surely simple as required here, are the MacKenzie co-ideals in a semi- 
group [15]: 

DEFINITION. If S is a commutative semigroup and a G S , the subset 
{a} of S containing all divisors of a is called a co-ideal of S . 

The relation a \ b is reflexive, since a \ a for all a G S, and transi- 
tive since a \ b and b \ c imply a | c, for all a, 6, c G S; thus the relation 
a | b shares two of the four defining properties of a 'weak' linear order 
relation, a < b. In the special case of the positive integers, it is also 
antisymmetric, since for each a, 6, 

a | b and b \ a imply a = 6, 
which corresponds to the linear order property 

a < b and b < a imply a = b. 

The only other property of a linear order relation that a \ b lacks in 
general is, given any a,b G S, 

either a < b or b < a, if a ^ b. 

A co-ideal {a}, is a finite subset of S, partially ordered by the 
divisor relation x\y in S, that we may also write as x < y using 
customary notation for a partial order relation. 



3 



As partially ordered sets, co-ideals can be represented as the 
union of chains by a theorem of Dilworth [6, theorem 1.1]; for the 
finite case, sufficient here, see Tverberg [18]. 

Each x in such a chain is a proper divisor, x < y, of each y above 
it in the chain. These chains, being finite linearly ordered sets are 
well ordered, and S is said to satisfy the divisor chain condition. 

In terminology for partially ordered sets, a is the universal (great- 
est) element of the co-ideal {a} generated by a, and the identity el- 
ement, 1, for products in the ring containing S is the null (least) 
element of {a}. In the following example, a is a square free inte- 
ger and the co-ideal {a} is a boolean algebra. In less special cases, 
co-ideals can have more complicated structures. [Cf. § 10, below.] 

EXAMPLE: The co-ideal of the divisors of 30 has a total of 2 3 members 
and is the union of the following three chains. 

1, 2, 6, 30; 1, 3, 15, 30; 1, 5, 10, 30. 

The atoms of a co-ideal, i. e., members covering only the null element, 
are irreducible elements of S since 1 is their only proper divisor. 

In a partially ordered set if either a < b or b < a, a and b are 
called a comparable pair, otherwise a and b are called noncomparable. 
According to Dilworth's theorem involved here, a partially ordered 
set P can be decomposed into the union of k chains provided each 
subset of P with k + 1 elements contains a comparable pair. 

The atoms in {a} have the maximal number of noncomparable 
elements for {a} and the number of chains is the same as the number 
of atoms. 

Euclid need not have assumed well ordering of the positive inte- 
gers themselves was essential for their factorization into products of 
irreducibles. Here we have found well ordering where it is needed, in 
chains of divisors of integers, which is just where Euclid used it. 

Euclid's criterion (2), or something effectively equivalent to it, 
is essential for uniqueness of the representation of positive integers 
as products of irreducibles and is what rules out the possibility of 
obtaining distinct products that are equal, say, 

P1P2 • ■ -Pk = qiQ2 • ■ • qi, Pi,qj irreducibles, 
where the intersection of the set of ps and the set of gs is empty. 
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4. Liouville rings of arithmetic functions. (F) 

If F is the field of rational, real, or complex numbers, we consider 
rings of arithmetic functions, whose elements are sequences of mem- 
bers of F indexed by positive integers, and prove unique factorization 
for this family of rings. In § 9 we consider sequences of integers. 

Let R be a commutative and associative ring defined as follows, 
see Dickson [8], Bell [2], and Apostol [1]: 

(1) The elements of R are all sequences, indexed by the positive 
integers, whose members are elements of F. We write the sequences 
in the form a(n) where n G S, a being an arbitrary function mapping 
the positive integers into the field F. 

(2) Addition in R is defined simply as (a + (3)(n) = a(n) + (3(n). 

(3) The zero of R, i.e., the identity element for addition, is the 
constant sequence u(n) = 0, for all n £ S. 

(4) Products in R are defined by the finite sums 

(a*p)(n) = <*(d)W), 

d5=n 

where d, 5 run through the positive integers satisfying dS — n. 

(5) The identity element for products is the sequence defined by 

e(n) = 1 for n = 1 and e(n) = for n > 1. 

Each R is an integral domain: if a, (3 ^ u>, then a, b exist with 
a(a)(3(b) ^ 0, and a(n)/3(m) = for n < a, m < b; hence (a*(3)(ab) = 
a (a) (3(b) ^ 0, i.e., products of nonzero elements are not zero. 

Familiar members of each of these rings are Euler's (^-function, 
the number of positive divisors of n, the sum of those divisors, etc. 

The units in these rings are the divisors of e. To find all units we 
suppose a G R, (a * a')(n) = 1 for n = 1, (a * a')(n) = for n > 1, 
and solve for a'. Now with a(l) G F, and 

a(l)a'(l) = 1, a(l)a'(2) + a(2)a'(l) = 0, a(l)a'(3) + ■ ■ ■ = 0, . . . , 

we must assume a(l) ^ 0; this infinite system of equations can then 
be solved recursively by induction for o/(l), o/(2), o/(3), — The 
solution is unique since the coefficient matrix of the first n equations 
is triangular and all elements on the diagonal are a(l). See Bell [3]. 

Thus all functions a with a(l) ^ are units in R, and only 
these. Observe that the sequences a in R that are units in R have 
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initial terms oj(1) that are units in the underlying field F: every 
nonzero element d in a field is a unit since it divides 1, its co factor 
being its reciprocal: dd~ l = 1. In R, all multiplicative functions have 
a(l) = 1. Therefore the most familiar members of R, being units, are 
of negligible interest when it comes to unique factorization. 

Consider the nonzero nonunits, ct(l) = 0, in R; more precisely, 
for some a > 1, a(n) = for n < a and a(a) ^ 0. If a and (3 are such 
nonunits, (a * (3)(l) = a(l)(3(l) = 0. Also (a * (3)(p) = a(l)(3(p) + 
a(p)P(l) = for every prime number p. CONTRAPOSITIVE: Every 
nonzero nonunit that is not zero for one or more prime numbers p 
cannot be expressed as a product of two nonzero nonunits; it is an 
irreducible element. 

Among irreducible elements of R: the characteristic function x 
of any set of prime numbers, i. e., x(n) is 1 if n prime, zero otherwise; 
(7r(?i)) 2 or, if F is the real field, logn, log(n!), log(n n ), etc. 

Let us call a the rank of a if for a > 1, a(a) ^ 0, and a(n) = 
when n < a. If (3 has rank 6, then the rank of a * f3 is the least n for 
which (a * b)(n) ^ 0. Since a{a)(3(b) ^ 0, and a(d)(3{8) = if d < a 
or 5 < 6, it follows that 

(a*P)(ab)= a(d) (3(8) = a(a) (3(b) 

dS=ab 

and the rank of the Liouville product a * (3 is ab. 

If we extend this definition of rank so that units have rank 1, 
it follows that the rank of a product a * (3, for a, (3 nonzero, is the 
product of the ranks of its factors. Except for the zero element uj, 
every sequence in R has finite rank. 

Suppose a and (3 have the same rank q, that is a(q) ^ 0, (3(q) ^ 0, 
and a(r) — (3(r) — for 1 < r < q. It follows that a, (3 differ by a 
unit factor: assume a = (3 * 7. By hypothesis 

a(q) = J2 P( r M s ) = MWq) + "• + )%h(l) = ]%h(l)- 

rs=q 

But a(q) 7^ and (3(q) 7^ imply 7(1) 7^ 0. If g is a prime, a and /? 
are irreducible, q being the first prime for which both are not zero. 

5. Co-ideals in Liouville rings. (F) 

In Liouville rings we define division and divisors as before: a(n) 
divides (3(n) means a * 7 = (3 for some 'j(n) G R. In these cases we 
write T for the semigroup of nonzero members of R. 
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Suppose {a} is the co-ideal of an arbitrary member a(n) of T; 
when a, (3 e T, if then {a} C {/?}, and conversely. If {a} C {/?}, 
we call a a proper divisor of [3. If ck j /3 and /3 | a, then {a} = {/?} and 
conversely. In the latter case, we say a and (3 are associates and for 
this relation we write a ~ j3. 

If ct(n) is a unit, {a} contains all the units in T, since every unit, 
by virtue of dividing every member of T, divides a. 

Clearly if 8 is a unit and if a * 8 = (3, then a\(3 and, since 8 is a 
unit, we also have (3\a because (3 * 8' = a where 8' is the reciprocal 
of 8; i. e., 8 * 8' = s. Thus if a and (3 differ by a unit factor, a ~ /3. 

The relation a ~ /3 is at the very least an equivalence relation, 
i.e., it is reflexive, transitive, and symmetric: 

Reflexivity: a = e * a (e is a unit). 

Transitivity: a = 8\ * f3 and /3 = 82 * 7 imply a = 8\ * 82 * 7 
(Si * 82 is a unit, if $i and $2 are units). 

Symmetry: a ~ (3 implies (3 ~ a: if 5 is a unit and a * 6 = f3, 
then /3 * 5' = a if 5 * 5' = £. 

Congruence: «i ~ «2 and /3i ~ $2 imply «i * (3\ ~ «2 * P2, since 
«i = 5i * ct2 and Pi = 82* 02 imply a.\ * (3i = (81 * 82) * ct2 * (3i- 

With the last property included, the equivalence relation shares an 
additional well-known property with congruence relations. 

For r > 1 let v r [n) = 1 if n = r and v r [n) = if n 7^ r. Since the 
functions in each class of associates have the same rank, each class 
contains one of the functions u r ; and if p and q are equal or distinct 
positive integers, v v *v q = v vq , i.e., [y v * v q )(n) = v vq {ri) for n > 1. 

6. Reduced semigroups of Liouville rings. (F) 

The elements of T are all the nonzero members of R, includ- 
ing, along with each member ct, all of its associates. Let a be the 
equivalence class whose members are a and its associates. Thus the 
semigroup T is the union of all the classes a for a G T, and if a and 
(3 are not associates, the intersection a fl (3 is empty. 

For simplicity, we resort to a homomorphism, i. e., a many-to-one 
mapping from a larger set to a smaller set; we need one that maps 
Liouville products in T into Liouville products in the collection T of 
its equivalence classes. Thus we can transfer attention to a semigroup 
whose members are the equivalence classes of T. We define Liouville 
products in the semigroup T of equivalence classes as follows. 
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Given classes a and /?, we take one element from each class, say 
ai G a and (3\ G /?, and form their product cx\ * 

According to the congruence-condition in the previous section, 
the product cx\ * j3\ is in the same class with every other product of 
the form ct 2 * P2 where «2 is any member of a, and P2 is any member 
of p. Since the class containing these products depends only on the 
classes used and not on the members multiplied, we define 

a * (3 to be a* (3. 

By shifting attention from T to T we obtain a major simplification: 
note, for one example, the equivalence class containing all the units 
of T is the identity element for the Liouville product in T. 

T is called the reduced semigroup of T. 

7. Isomorphism of the reduced semigroups S and T. (F) 

We have seen the associates of a all have the same rank. So 
there is a one-one correspondence between classes of associates and 
the common rank of their members: a <-> a, a G T, a G S. 

The co-ideals {a} and {a} are isomorphic as partially ordered 
sets, i.e., if a in the latter co-ideal is paired with its rank a in the 
former co-ideal, the divisor relations correspond since /3 | a in T if and 
only if b | a in S. Therefore the partially ordered sets {a} and {57} are 
identical, except for the alphabets used to describe them. 

8. Factorization into irreducibles and its uniqueness. (F) 

For T we argue as for S, with the result that factorization into 
finitely many irreducible equivalence classes is possible and unique. 

Each irreducible equivalence class can be represented by an ar- 
bitrary member of the class and alternate representatives may be 
chosen if in each instance a compensating unit factor is introduced as 
well; moreover, no matter how many alternate choices are made, the 
resulting product of units is itself a unit. 

Thus each composite function a, can be written as the product 
of a single unit and finitely many irreducible factors. 

Hence factorization of the elements of R takes the form 

5 Hi 7r 2 ■ • -n k , 

where 5 is a unit and tti, tt2, ■ ■ ■ , 717. are irreducible members of R, a 
representation that is unique, except for the order of the factors and 
replacement by associates of the factors present. 
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9. Special Case: The subring of sequences of integers. (I) 

The first proof for unique factorization in a ring of arithmetic 
functions was carried out for the field of complex numbers, a special 
case of the family of fields (F) described above in § 4; see Cashwell and 
Everett [4]. These authors conjectured unique factorization should 
also hold for the ring of integer valued arithmetic functions and proved 
it soon after [5]. In both cases F and I their methods were transfinite. 

We now consider the case of the Liouville ring of arithmetic func- 
tions with integral values and encounter no significant differences from 
our elementary proofs when the functions have values in a field. 

For this special case, we observe first that integers form a sub- 
ring of each field F considered before, and the ring of sequences of 
integers is a subring of each of the corresponding rings R. The argu- 
ment presented in sections 4 through 8 can be carried out simply by 
restricting attention to sequences of integers. 

There are two differences between the cases based on F and I. 

One is the paucity of units: for sequences of integers units have 
o(l) = ±1 instead of o(l) 7^ as for fields. This difference is negligi- 
ble, the homomorphism simply wipes it out. 

The difference in what qualifies as a unit entails the other dif- 
ference: there are more nonunits, indeed there is a new family of 
irreducible elements. Because the defining condition for units now is 
|o(l)| = 1 instead of a(l) ^ 0, the two kinds of nonunits are distin- 
guished since they satisfy either a(l) = or |o(l)| > 2. 

The case ot(l) = is common to the earlier cases and the integral 
case, and presents no novelty. 

The case |o(l)| > 2 is even simpler. Observe that if a(n) = 
(3(n) * 7(n) for all n > 1 and if |o(l)| = then 7(1) = ±1 

since |o(l)| = \f3(l) \ ■ |7(1)|- These nonunit sequences have rank one, 
in the terminology of the previous cases, and their only divisors are 
associates and units; therefore they are irreducibles. 

In spite of these differences, the variant of Euclid's argument goes 
through as for the fields F considered earlier. 

10. Summary of criteria for unique factorization. 

The Euclidean algorithm (Elements, VII, 1,2), often considered 
in this context, is obtained by repeated subtraction, which is easy 
for positive integers because they are (well) ordered and the division 
algorithm is available; but not in R. 
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Gauss argues that if a prime number divides neither a nor b, it 
cannot divide their product; this is a version of Euclid's condition (2). 
Gauss proves it [13, II, 14, 15] using congruences, also based on division 
with remainder and difficult to achieve without an order relation. 

Dirichlet, in his lectures [9, §1.8] avoids GCDs and the division 
algorithm, and argues as follows. If p is irreducible, its only divisors 
are 1 and p, so p shares no other divisors with any number. Dirichlet's 
observation implies Gauss's version of Euclid's criterion (2), as an 
immediate consequence of the definition of irreducibles. 

For the cases considered here the essential hypothesis is the di- 
visor chain condition, and for this have co-ideals. 

For proofs in / and R the following hypotheses are superfluous: 
the division algorithm, the Euclidean algorithm, Euclid's (2), and 
existence of GCDs. The more relevant of these come from the co- 
ideals and the definition of irreducibles. 

Co-ideals are lattices: We claimed the co-ideal of divisors of 30 
is a boolean algebra because 30 has no square divisors. 

A boolean algebra is a distributive lattice with unique comple- 
ments; its characteristic property is that it is isomorphic to the lattice 
of all subsets of some set. The co-ideal {30} is isomorphic to the col- 
lection of all subsets of the set {2,3,5}, including both the empty 
subset and the full set representing the divisors 1 and 30. 

If in a partially ordered set each pair of elements x, y has a 
greatest lower bound and a least upper bound in its partial order 
relation x < y, the partially ordered set is called a lattice; the GLB 
and LUB are called, respectively, the meet and join of x and y and 
are denoted by x A y and x V y. 

If the partial order relation is the divisor relation x \ y, x A y is 
the GCD of x and y and x V y is their LCM. See Polya and Szego [16]. 

A lattice is complemented if (a) it contains a universal element 
and a null element, often denoted by / and O, respectively; and (b) if 
for each member x there is at least one member x' with the properties 
x A x' = O and x V x' = I. (c) If each x has exactly one complement 
x', the lattice is said to have unique complements. 

A lattice is distributive if x V (y A z) = (x V y) A (x V z) holds 
for all its members x, y, z. A necessary and sufficient condition for 
distributivity isxAy = xAz and x\/y = xV ' z imply y = z, Crawley and 
Dilworth [6, p. 21]. If the partial order is the divisor relation x \ y both 
of these assertions are properties of GCDs and LCMs. The necessary 
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and sufficient condition follows immediately from (x,y)[x,y\ = xy 
and also implies that any complements that exist are unique. 

For the special case of the co-ideal {30}, the complementary pairs 
are 1' = 30, 2' = 15, 3' = 10, 5' = 6, and by symmetry, 30' = 1, etc. 
It follows that the co-ideal {30} is a boolean algebra. 

The co-ideal {12} is the union of the following pair of chains 
(1,2,4,12) and (1,3,6,12). Complementary pairs of elements are 
1' = 12, and 3' = 4, but not 2 and 6 since 2 A 6 = 2 and 2 V 6 = 6. 
Here 12 is not square free; it is divisible by the square of 2. For more 
on lattices, see Crawley and Dilworth [6, chapters 1, 2]. 

For earlier studies of Liouville rings see 4, 5, and 7; these papers 
use transfinite methods. The full force of Dilworth's theorem does 
also; it is proved using Zorn's lemma. 

For the relation between arithmetic functions and Dirichlet se- 
ries, see Polya and Szego, [16]. Cashwell and Everett's articles are 
based on the fact that Dirichlet series may be represented as power 
series in infinitely many variables, one variable for each positive prime 
number. In 1913 Harald Bohr published a study of the analytic prop- 
erties of this pair of series representations in Gottinger Nachrichten, 
pages 444-488 [Reprinted as Item A 9, in volume one of Bohr's Col- 
lected Mathematical Works, K0benhavn (1952)]. 

The product * in R is often called Dirichlet convolution: see 
Apostol [1, §2.14] and Popken [17]. 
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